MetaAugment: Sample-Aware Data Augmentation Policy Learning

نویسندگان

چکیده

Automated data augmentation has shown superior performance in image recognition. Existing works search for dataset-level policies without considering individual sample variations, which are likely to be sub-optimal. On the other hand, learning different samples naively could greatly increase computing cost. In this paper, we learn a sample-aware policy efficiently by formulating it as reweighting problem. Specifically, an network takes transformation and corresponding augmented inputs, outputs weight adjust loss computed task network. At training stage, minimizes weighted losses of images, while on validation set via meta-learning. We theoretically prove convergence procedure further derive exact rate. Superior is achieved widely-used benchmarks including CIFAR-10/100, Omniglot, ImageNet.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Policy Aware Geospatial Data

Digital Rights Management (DRM) prevents end-users from using content in a manner inconsistent with its creator’s wishes. The license describing these use-conditions typically accompanies the content as its metadata. A resulting problem is that the license and the content can get separated and lose track of each other. The best metadata have two distinct qualities – they are created automatical...

متن کامل

Visual Data Augmentation through Learning

The rapid progress in machine learning methods has been empowered by i) huge datasets that have been collected and annotated, ii) improved engineering (e.g. data pre-processing/normalization). The existing datasets typically include several million samples, which constitutes their extension a colossal task. In addition, the state-ofthe-art data-driven methods demand a vast amount of data, hence...

متن کامل

Inefficiency of Data Augmentation for Large Sample Imbalanced Data

Many modern applications collect large sample size and highly imbalanced categorical data, with some categories being relatively rare. Bayesian hierarchical models are well motivated in such settings in providing an approach to borrow information to combat data sparsity, while quantifying uncertainty in estimation. However, a fundamental problem is scaling up posterior computation to massive sa...

متن کامل

Towards Policy-aware Queries over Linked Data

The Linked Data principles for publishing data on the Web enable the distributed evaluation of queries, where data sources are discovered during runtime. Data sources can have associated licenses that restrict allowed usages. Besides restrictions on the access to the data sources, the usage terms can also restrict the usage terms which can be assigned to derived data artefacts, e.g. query resul...

متن کامل

Semi - supervised Learning Methods for Data Augmentation

The original goal of this project was to investigate the extent to which data augmentation schemes based on semi-supervised learning algorithms can improve classification accuracy in supervised learning problems. The objectives included determining the appropriate algorithms, customising them for the purposes of this project and providing their Matlab implementations. These algorithms were to b...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2021

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v35i12.17324